An Automatic Annotation Technique for Web Search Results
نویسندگان
چکیده
The uses of web search engines are very frequent and common worldwide over the internet by end users for different purposes. A web search engine takes the query request from the end user and executes that query on relational database used to store the information on behalf of that web search engine. Based on input queries the dynamic response is generated by search engine, in the form of HTML based pages. Such pages are supported with the web databases. Every web page generated contains many results to display for particular query, called as Search Result Records (SRRs). Sometimes it becomes troublesome to extract relevant data from diverse sources. The SRRs generated may contain data units that are relevant to one common semantic. These SRRs are further required to be assigned with proper labels. The manual methods for record extraction and labeling have a worse scalability. Thus automatic annotation based method is needed to improve the accuracy as well as scalability of web search engines. This paper presents an automatic annotation technique for web search results. The proposed approach first aligns the data units on a result page into different groups such that the data in the same group have the same semantic. Then, each group is annotated from different aspects and aggregates the different annotations to predict a final annotation label for it. The annotation wrapper generated for the search site is automatically constructed and can be used to annotate new result pages from the same web database. Experiments indicate that
منابع مشابه
Fuzzy Neighbor Voting for Automatic Image Annotation
With quick development of digital images and the availability of imaging tools, massive amounts of images are created. Therefore, efficient management and suitable retrieval, especially by computers, is one of themost challenging fields in image processing. Automatic image annotation (AIA) or refers to attaching words, keywords or comments to an image or to a selected part of it. In this paper,...
متن کاملReview on Automatic Annotation of Query Results from Deep Web Database
In recent years, web database extraction and annotation has received much attention from the database and Information Extraction(IE) in research area due to the volume and quality of deep web. Many web databases are accessible through HTML formbased interface. When query is submitted to the search interface the query result page is generated. Search Result Records(SRRs) are the result pages obt...
متن کاملProbability Association Approach in Automatic Image Annotation
introduction Content-based image retrieval (CBIR) has wide applications in public life. Either from a static image database or from the Web, one can search for a specific image, generally browse to make an interactive choice, and search for a picture to go with a broad story or to illustrate a document. Although CBIR has been well studied, it is still a challenging problem to search for images ...
متن کاملWeb Image Search by Automatic Image Annotation and Translation
There has been a growing interest in implementing online Web image search engine in the semantic level. However, most existing practical systems including popular commercial Web image search engines like Google and Yahoo! are either textbased or a simple hybrid of texts and visual features. This paper proposes a novel Web Image Search by Automatic Image Annotation and Translation (WISAIAT) syst...
متن کاملAnnotation-Based Automatic Action Processing
With a strong motivational background in search engine optimization the amount of structured data on the web is growing rapidly. The main search engine providers are promising great increase in visibility through annotation of the web page’s content with the vocabulary of schema.org and thus providing it as structured data. But besides the usage by search engines the data can be used in various...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016